Gate { a Tipster-based General Architecture for Text Engineering
نویسندگان
چکیده
Building on the work of the TIPSTER architecture group, the University of Sheeeld Natural Language Processing group have developed GATE, a General Architecture for Text Engineering. GATE implements the TIPSTER document manager, and adds a rich set of graphical tools for the management of modules and the data they produce and consume, and the visualisation of data. This paper classiies and reviews current approaches to software infrastructure for research, development and delivery of NLP systems. The task is motivated by a discussion of current trends in the eld of NLP and Language Engineering. The rest of the paper describes GATE and concludes with a status report on the system.
منابع مشابه
TIPSTER-Compatible Projects at Sheffield
Projects currently underway at Sheffield may be more appropriately described by the term Language Engineering than the well-established labels of Natural Language Processing or Computational Linguistics. This reflects an increased focus on viable applications of language technology, promoting a view of the software infrastructure as central to the development process. To this end, Sheffield has...
متن کاملLasie Jumps the Gate
The beneets of the eeective creation of Information Extraction (IE) in the last ten years, driven by the ARPA TIPSTER programme and the associated MUC evaluations, have been enormous, but it must now be time to ask what research issues face the systems we have built and what we should do next. We suggest that there are two classes of important research issues: those requiring detailed comparati...
متن کاملImplementing a Sense Tagger in a General Architecture for Text Engineering
We describe two systems: GATE (General Architecture for Text Engineering), an architecture to aid in the production and delivery of language engineering systems which significantly reduces development time and ease of reuse in such systems. We also describe a sense tagger which we implemented within the GATE architecture, and which achieves high accuracy (92% of all words in text to a broad sem...
متن کاملEllogon: A New Text Engineering Platform
This paper presents Ellogon, a multi-lingual, cross-platform, general-purpose text engineering environment. Ellogon was designed in order to aid both researchers in natural language processing, as well as companies that produce language engineering systems for the end-user. Ellogon provides a powerful TIPSTER-based infrastructure for managing, storing and exchanging textual data, embedding and ...
متن کاملTIPSTER Text Phase II Architecture Concept Version 1.1.1p 3 June 1996
The TIPSTER Architecture is a software architecture for providing Document Detection (i.e. Information Retrieval and Message Routing) and Information Extraction functions to text handling applications. The high level architecture is described in an Architecture Design Document. In May 1996, when the initial architecture design is complete, an Interface Control Document will be provided specifyi...
متن کامل